home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
EnigmA Amiga Run 1997 February
/
EnigmA AMIGA RUN 15 (1997)(G.R. Edizioni)(IT)[!][issue 1997-02][PLANET CD V].iso
/
enigma
/
earcd
/
utility
/
utilfile
/
rw2ent16.rdm
< prev
next >
Wrap
Text File
|
1997-01-18
|
3KB
|
64 lines
Short: ASCII <-> Entity-Codes or Umlauts
Author: honma@thepentagon.com (Tamio Patrick Honma)
Uploader: honma@thepentagon.com (Tamio Patrick Honma)
Version: 1.6
Type: util/conv
Replaces: util/conv/Raw2Ent*
Files: CWISENV
Raw2Ent 1.6 (06.12.96)
Raw2Ent.doc [this file]
Raw2Ent.rexx 1.4.1 (14.07.96)
Raw2Ent.rexx.old 1.3.4 (01.11.96)
WHAT'S NEW SINCE LAST RELEASE?
CHECK-MODE -> this is the mode to check 7-Bit-ASCII
NOTE: Since version 1.5 you can use Raw2Ent as an Ent2Raw!
NOTE: Since version 1.3 you can use Raw2Ent as a "Raw2Uml", which means
that the destination file will contain readable characters, words or
shortcuts. (i.e.: "ü" -> "ue", "£" -> "pound", "©" -> "(C)", etc.)
INTRODUCTION
Raw2Ent converts raw 8-Bit-ASCII-Text into 7-Bit-ASCII-Text with
entity-codes and reverse. The ASCII-Format is a standardized format for
information interchange, but it is only standardized seven-bit-wide, which
means that 128 codes are defined. One Byte consists of eight bits and can
represent 256 different bit combinations. Therefore the last 128 bit
combinations are defined for free use by any operation-system. The problem
is that accent-characters and other special characters are not standardized,
because they are defined in (guess where?! ;) ) the free part of ASCII by
the operation system developers.
The goal of the Wold Wide Web developers was that it could be used on every
important operation system. So it was clear that the ASCII-Based
HTML-Source-Code had to use the standardized seven-bit area of the
ASCII-Code. To represent accent-characters or other special characters in a
seven-bit-code, it was neccesary to invent something. And this was the
entity-code - a kind of escape-code. An entity-code consists of an
introducing "&" and a ";" at the end. Between these symbols is a
character-name the browser can interpret. It is a very hard and stupid work
to convert the ASCII-Text by hand. So just use Raw2Ent!
Raw2Ent produces real 7-Bit-ASCII-Code. All printable Amiga-characters in
the 8-bit-area will be converted into entity-codes, without any exception.
The use of names instead of code-numbers will make the entity-codes easier
to be read by humans. If no name was defined in the HTML 3.2 reference, the
code-number will be used.
You can use Raw2Ent also to check, wether your file contains pure seven-bit
codes or not. If not, Raw2Ent will present you the position of the
8-Bit-Characters in the text.
SEND COMMENTS TO:
Tamio Patrick Honma
eMail: honma@thepentagon.com
WWW: http://www.netforward.com/thepentagon/?honma